Pictorial Recognition of Objects
نویسنده
چکیده
This paper describes an eecient approach to pose invariant pictorial object recognition employing spectral signatures of image patches that correspond to object surfaces which are roughly planar. Based on Singular Value Decomposition (SVD), the aane transform is decomposed into slant, tilt, swing, scale and 2D translation. Unlike previous log-polar representations which were not invariant to slant (i.e. foreshortening only in one direction), our log-log sampling connguration in the frequency domain yields complete aane invariance. The images are preprocessed by a novel model based segmentation scheme that detects and segments objects that are aane-similar to members of a model set of basic geometric shapes. The segmented objects are then recognized by their signatures using multi-dimensional indexing in a pictorial dataset represented in the frequency domain. Experimental results with a dataset of 26 models show 100% recognition rates in a wide range of 3D pose parameters and imaging degradations: 0 ?360 swing and tilt, 0 ?82 of slant (more than 1:7 foreshort-ening), more than 3 octaves in scale change, window-limited translation, high noise levels (0 dB) and signiicantly reduced resolution (1:5).
منابع مشابه
The Effect of Pictorial Flashcards on the Sight Word Recognition in Kindergartens
It was a quasi-experimental study because the study involved in training participants in twoclasses each containing about 5 to 6 years old pre-primary students. To this end, fifty studentsparticipated in the study who were studying at Misagh School in Tabriz. In order to makesure of their homogeneity, the researcher administered a pre-test. Based on the results, 40students were selected as the ...
متن کاملEpisodic encoding and recognition of pictures and words: role of the human medial temporal lobes.
In the present PET study, we examined brain activity related to processing of pictures and printed words in episodic memory. Our goal was to determine how the perceptual format of objects (verbal versus pictorial) is reflected in the neural organization of episodic memory for common objects. We investigated this issue in relation to encoding and recognition with a particular focus on medial tem...
متن کامل2D but not 3D: pictorial-depth deficits in a case of visual agnosia.
Patients with visual agnosia exhibit acquired impairments in visual object recognition, that may or may not involve deficits in low-level perceptual abilities. Here we report a case (patient DM) who after head injury presented with object-recognition deficits. He still appears able to extract 2D information from the visual world in a relatively intact manner; but his ability to extract pictoria...
متن کاملPictorial Recognition Using Affine-Invariant Spectral Signatures
This paper describes an efficient approach to pose invariant object recognition employing pictorial recognition of image patches. A complete affine invariance is achieved by a representation which is based on a new sampling configuration in the frequency domain. Employing Singular Value Decomposition (SVD), the affine transform is decomposed into slant, tilt, swing, scale and 2D translation. Fr...
متن کاملExtending Pictorial Structures for Object Recognition
The goal of this paper is to recognize various deformable objects from images. To this end we extend the class of generative probabilistic models known as pictorial structures. This class of models is particularly suited to represent articulated structures, and has previously been used by Felzenszwalb and Huttenlocher for pose estimation of humans. We extend pictorial structures in three ways: ...
متن کامل